SORTING USING COMPLETE SUBINTERVALS AND THE 
MAXIMUM NUMBER OF RUNS IN A RANDOMLY 
EVOLVING SEQUENCE. 

SVANTE JANSON 

Abstract. We study the space requirements of a sorting algorithm 
where only items that at the end will be adjacent are kept together. 
This is equivalent to the following combinatorial problem: Consider a 
string of fixed length n that starts as a string of O's, and then evolves by 
changing each to 1, with the n changes done in random order. What 
is the maximal number of runs of l's? 

We give asymptotic results for the distribution and mean. It turns 
out that, as in many problems involving a maximum, the maximum is 
asymptotically normal, with fluctuations of order n 1 ^ 2 , and to the first 
order well approximated by the number of runs at the instance when 
the expectation is maximized, in this case when half the elements have 
changed to 1; there is also a second order term of order n 1//3 . 

We also treat some variations, including priority queues. 

The proofs use methods originally developed for random graphs. 



1. Introduction 

Gunnar af Hallstrom pQ considered, as indicated at the end of his paper, 
the following algorithm for sorting an unordered pile of student exams in 
alphabetic order. (It is said that he used this procedure himself.) 

The exams are taken one by one from the input. The first exam is put in a 
new pile. For each following exam (x, say), if the name on it is immediately 
preceding the name on an exam y at the top of one of the piles, the new exam 
x is put on top of y. (The professor knows the names of all the students, 
and can thus see that there are no names between x and y.) Similarly, if the 
name on x is immediately succeeding the name on an exam z at the bottom 
of a pile, x is put under z. If both cases apply, with y on top of one pile 
and z at the bottom of another, the two piles are merged with x inserted 
between z and y. Finally, if there is no pile matching x in one of these ways, 
x is put in a new pile. 

The algorithm thus maintains a list of sorted piles, each being an interval 
without gaps of the set of exams. At the end, there is a single sorted pile. 
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The problem is the space requirement of this algorithm; more precisely, 
the maximum number of sorted piles during the execution. The input is 
assumed to be in random order, so this is a random variable, and we are 
interested in its mean and distribution. 

Remark 1.1. As a sorting method, this algorithm has drawbacks. First, it 
requires that all names are known from the beginning; mathematically it can 
be seen as sorting the numbers 1, . . . ,n. Secondly, the space requirement 
turns out to be quite high, see below. This also implies that the number 
of comparisons necessary for each insertion is high, of the order of n. The 
algorithm might be useful when blocks of sorted items can be manipulated 
as easily as individual items, and we do not want to make insertions inside 
the blocks, for example when sorting physical objects that are to be glued 
together in order. 

af Hallstrom [lj gave the following mathematical reformulation, where we 
also introduce some notation. Consider a deck of n cards numbered 1, . . . , n 
in random order, and a sequence of n places with the same numbers in order. 
Take the cards one by one and put them at their respective places. When we 
have placed m cards, < m < n, we see X n ^ m "islands", i.e. uninterrupted 
blocks of cards. What is X* := max m X n m ? 

Alternatively, we can use the language of parking cars, which is popular 
for some related problems in computer science: n cars park, one by one, 
on n available places along a street; each car parks at a random free place. 
What is the maximum number of uninterrupted blocks of cars during the 
process? 

Let, for n > 1, < m < n and 1 < k < n, the indicator I n ,m(k) be 1 if 
the item (exam or card) with number k is one of the m first in the input, 
and otherwise. Thus, X n ^ m is the number of runs of l's in the random 
sequence I„ jm (l), . . . , I„ im (n) of n — m O's and m l's. We can express X n ^ m 
algebraically as 

ti-1 

X n) m = In,m(X) + ^ ] (1 — ^n,m(^))^n,m(^ + 1) 
fc=l 

ra-1 

= m- } y In,m{k)l n ,m{k + 1). (1.1) 
k=l 

If the input is given by the permutation a of {1, ... , n}, so that item k 
has position a~ 1 (k), 

where 1[. . .] denotes the indicator of the indicated event. We assume that 
a is a (uniformly chosen) random permutation; thus so is cr _1 . Hence, each 
random sequence {I n ,m{k))^ =l is uniformly distributed over all (^) possibil- 
ities; moreover, for each m < n we obtain (I n ,m+i(k))^ =1 from (In,m(k)) k l = i 
by changing a single randomly chosen to 1, this random choice being uni- 
form among the n — m O's, and independent of the previous history. 
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It is easy to see that EX n>m = m(n — m + l)/n, see (j3.ip : it follows that 
the maximum of El„. m for a given n is attained for m = |~n/2] , and that 
M„r„/2i > n/4. Since obviously 

EI* =E max X nm > max E X n m , (1.2) 

m ' m 

this yields EI* > n/4 as observed by af Hallstrom pQ. Moreover, he ob- 
served that EI* is subadditive, and thus the limit 

7 := lim EX*/n 

exists and equals inf n E A*/n; he further showed that 1/4 < 7 < 1/3, where 
the lower bound comes from (jl.2p . Based on simulations with n = 13 and 
n = 52, af Hallstrom pQ concluded that 7 seems to be very close to or 
equal to 1/4. We will show that, indeed, 7 = 1/4. We also show that the 
distribution of I* is asymptotically normal, with a variance of order n. 

Theorem 1.2. As n — ► oo, 

n- l l\X* n - n/4) ^N(0, 1/16), (1.3) 
mi/i convergence of all moments. In particular, 

EX*=n/4 + o(n 1/2 ), 
Varl* =n/16 + o(n). 

This theorem says that to the first order, the maximum number of piles 
(runs) X* behaves like the number X n ^ m with m = |~n/2~|. A more refined 
analysis shows that the difference X* — X n r n /2\ is of order n 1//3 . Let B(t), 
— oo<i<oo, be a standard two-sided Brownian motion; thus -B(O) = 
and B{t), t > 0, and B(—t), t > 0, are two independent Brownian motions. 

Theorem 1.3. As n — > 00, 

n- 1 / 3 ^-!,,,^)^^, (1.4) 

where the random variable V is defined by V := max^ (B(t)-t 2 /2), and 

EX* = EX n>H2] +\EVn 1 ' z + o{n 1 ^) = \n + \EVn 1 ^ + 0^). 

The random variable V is studied by Barbour |2j, Daniels and Skyrme 
[7] and Groeneboom [10J. Note that < V < 00 a.s. We have, see [7] 
(using Maple to improve the numerical values in [21 [3j 13 E]), with Ai the 
Airy function, 

2 -l/3 roo j d 

EF = / . sa 0.996193. 

2tt 7_ 0O Ai(iy)2 

The numerical values X* 3 ~ 4.22 and X52 ~ 14.66 found experimentally 
by af Hallstrom [l] differ from n/4 by about 18% and 10% less than the 
correction term ^ E 7k 1 / 3 in Theorem 11.31 which is a reasonable agreement 
for such rather small n. 
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Remark 1.4. af Hallstrom pQ considered also the cyclic case, when we 
regard {1, . . . ,n} as a circle, which sometimes is slightly simpler to study 
because of the greater symmetry. In this case we define I n ,m(k) for all k G Z 
by I n ,m(k + n) := I n ,m(k), i.e. we interpret modulo n, and we sum to n 
in (II. ip . Since the number of runs in the linear and cyclic version differ 
by at most 1, all our asymptotic results remain the same, and we will only 
consider the linear case. (Moreover, the cyclic case with n items corresponds 
exactly to the linear with n — 1 by fixing the last element, see pQ.) 

We prove these theorem by studying asymptotics of the entire (random) 
process (X nim )^ =0 . The natural time here is m/n, so we take m = \nt\ 
for < t < 1 and consider the process X n i with a continuous parameter 
t E [0,1]. The following theorem shows that this process asymptotically is 
Gaussian. (The space D[0, 1] is defined in Section HI see [J| for a detailed 
treatment.) 

Theorem 1.5. As n — > oo ; in the space D[0, 1] of functions on [0, 1], 

n-^ 2 {X n<[nt ^ - nt(l - t)) ^ Z(t), (1.5) 

where Z is a continuous Gaussian process on [0, 1] with mean E Z(t) = 
and covariances 

E(Z(s)Z(t)) = s 2 (l-t) 2 , 0<s<f<l. (1.6) 

The behaviour of X* shown in Theorems 11.21 and 11.31 with an asymp- 
totic normal distribution with a mean of order n and random fluctuations 
of order ra 1 / 2 , and with a second order term for the mean of order n 1 / 3 , 
is common for this type of random variables defined as the maximum of 
some randomly evolving process. For various examples, both combinatorial 
and others, and general results see for example Daniels [5j[6], Daniels and 
Skyrme [7], Barbour [21 [3] and Louchard, Kenyon and Schott [20] . Indeed, 
paraphrasing the explanations in these papers, in many such problems, the 
first order asymptotic of a random process X n (t) (after suitable scaling) is 
a deterministic function f(t), say, defined on a compact interval / (typically 
scaled to be [0, 1] as here). Hence the first order asymptotic of the maxi- 
mum of the process is just the maximum of this function /. Moreover, it is 
often natural to expect that the random fluctuations around this function 
/(t) asymptotically form a Gaussian process G(t); this is then a second or- 
der term of smaller order as in our Theorem 11.51 If we assume that / is 
continuous on / and has a unique maximum at a point to £ Ii then the 
maximum of the process X n (t) is attained close to to, so the first order ap- 
proximation of the maximum is the constant f(to) = maxt/(i), while the 
next approximation is just X n (to), giving a normal limit law as in our Theo- 
rem [L2l The Gaussian fluctuations in this limit have mean 0, so in order to 
find the next term for the mean El*, we study more closely the difference 
maxt X n (t) — X n {to) by studying the difference X n (t) — X n (to) close to to- 
Assuming that to is an interior point of I and that / is twice differentiable 
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at to with /"(to) 7^ 0, we can locally at to approximate / by a parabola 
and G(t) — G(to) by a two-sided Brownian motion (with some scaling), and 
thus maxt X n (t) — X n (to) is approximated by a scaling constant times the 
variable V above, see Barbour [2] and, in our case, Corollary 14.51 below. In 
the typical case where the mean of X n (t) is of order n and the Gaussian 
fluctuations are of order n 1//2 , it is easily seen that the correct scaling gives, 
as in Theorem [L3] above, a correction to EX* of order n 1 / 3 , see [2j[5j[6] and 
Section El 

The method used in the present paper is a simple adaption of the method 
used in [13] and [H] to study the number of subgraphs of a given isomor- 
phism type in a random graph. These papers study the random graphs 
G(n,p) and G(n,m) that can be constructed by random deletion of edges 
in the complete graph K n (with the deletions being independent for G(n,p) 
and such that a fixed number of edges are deleted for G(n, m)). The method 
applies more generally to random graphs constructed by random edge dele- 
tions in these ways from any fixed initial graph F n . The problem treated in 
this paper can be regarded as an instance of this when the initial graph is 
the path P n with n edges. In particular, Theorem 11.21 corresponds to [141 
Theorem 24] , which gives the asymptotic distribution of the maximum num- 
ber of induced subgraphs of a given type during the evolution of G(n,p) or 
G(n, m); see also [HI Theorem 33] (isolated edges) and [HI Theorem 17] (a 
general result) for related results. Conversely, we expect that these results 
for random graphs can be complemented by the analogues of Theorem 11.31 
above, using the method of proof in the present paper, but we have not 
verified the details. 

Our method applies also to other problems. First, let Xn} m be the number 
of piles with a single exam (runs with a single 1) in the process studied above. 
Then we obtain similar results for the maximum X^* := max m J^i,, see 

Section [7J The same applies to the number X^ m of piles with any other 
fixed number d of exams (runs of a fixed length d). 

Another example is given by priority queues, where Louchard [18] and 
Louchard, Kenyon and Schott [20] have proved asymptotic results very sim- 
ilar to the Theorems I1.2H1.5I above. In particular, they found the same 
asymptotic covariance (|1 .6[) except for a normalizing constant. (See also 
Flajolet, Frangon and Vuillemin [8] and Flajolet, Puech and Vuillemin [9] 
for combinatorial results on generating functions involving Hermite polyno- 
mials; these results, however, do not easily yield asymptotics.) 

Priority queues can be defined as follows. Suppose that n items are to 
be temporarily stored (or processed); let item i arrive at time Ai and be 
deleted at time D{. We assume that the 2n times Ai and Di are distinct; 
thus they can be arranged in a sequence of the 2n events Ai and Di , with Ai 
coming before Di for each i. We assume further, as our probabilistic model, 
that all (2n)!/2 n such sequences are equally probable. Ignoring the labels, 
we can equivalently consider sequences of n A and n D (or n + and n — ), 
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where each A is paired with a D coming later; there is a 1-1 correspondence 
between such sequences and pairings of 1, . . . , 2n into n pairs, and there are 
(2n — 1)!! = (2ra)!/(2 n n!) such sequences (with pairings), again taken with 
equal probability. 

Let, for m = 0, . . . , 2n, Y n) m be the number of items stored after m of 
these events, i.e. the number of A's minus the number of D's among the 
m first events, and let Y* := maxo< OT <2n Yn,m- The sequence (V n ,m)o n 
is a Dyck path, but note that its distribution is not uniform; for a given 
Dyck path (or a given sequence of A and D without labels), the number 
of ways to pair a given D with a preceding A, i.e. the number of ways 
to choose which item to delete, equals the current number of items stored 
before this deletion. Thus, the weight of the Dyck path equals the product 
of these numbers Ff™v Y nm . Alternatively, which better explains 

the name priority queue, we can keep the stored items in a list showing the 
order in which they eventually will be deleted; then there is only one choice 
for each deletion but each new item can be inserted in Y + 1 ways if there are 
Y items stored before the insertion, and thus Y + 1 after it; hence the weight 
can also be written as IJm:y n m >Y n _i ^n,m- (It is easily to see directly that 
the two products are equal.) 

We will in Section [8] show how our method applies to priority queues, and 
explain why we obtain the same asymptotic results as for X n ^ m and X*. 
(Note that there is no exact correspondence for finite n, since the natural 
sample spaces have n\ elements for X nm but (2n — 1)!! elements for Y n rn .^ 
Again, we can regard the problem as an instance of subgraph counts for 
randomly deleting edges from a given initial graph F n ; in this case taking 
F n to be a multigraph consisting of n double edges. 

A third example is a model suggested by Van Wyk and Vitter [23] as a 
model for hashing with lazy deletion, and further studied by Louchard [19] 
and Louchard, Kenyon and Schott [20]. In this model, n item arrives and 
are deleted as above, but now the arrival and deletion times Ai and Di are 
random numbers, with the n pairs (Ai,Di) mutually independent and each 
pair distributed as (TjATj, Tj VTj), where T« and Tj are independent random 
variables uniformly distributed on [0,1]. (We use A and V as notations for 
min and max of two numbers.) We let Y n (t) be the number of items present 
at time t, and again we are especially interested in its maximum maxt Y n {t). 
Again, the asymptotic results for the maximum found by Louchard, Kenyon 
and Schott [20] are the same as in our Theorems 11.21 and 11.31 except for a 
constant factor, while the asymptotic result for the process Y n (t) found by 
Louchard [19J differs somewhat from the one in Theorem [L5j it corresponds 
instead to the one in Corollary 14.21 below. Indeed, as explained by Kenyon 
and Vitter [17], see also Section El this model can be seen as a priority queue 
with randomized times for insertions and deletions, which explains why the 
results for the maximum are the same as for priority queues. 
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We assume in the sequel that n > 2, to avoid some trivialities. All 
unspecified limits are as n — > oo. We use the standard notations — — > and 
— > for convergence in probability and distribution, respectively, of random 
variables, and a.s. for almost surely, i.e. with probability 1. 

2. Randomizing time 

We will use the standard method of randomizing the time. More pre- 
cisely, we let T±,... ,T n be independent random variables, each uniformly 
distributed on (0, 1). We interpret T k as the time item k arrives, and note 
that a.s. there are no ties. We define 

I(t;k) = l[T k <t], 

i.e., I(t; k) = 1 if item k has arrived by time t. We further define N n (t) as 
the number of items that have arrived at time t, and X n (t) as the number 
of runs of l's at time t, i.e., cf. (jl.lh . 



N n (t) = J2 I (t;k), (2.1) 
fc=i 

n-l 

X n (t) = I(t; 1) + 2(1 - /(t; k))l(t; k + 1) (2.2) 
fc=i 

n-l 

= # n (t)-2*(*; *)*(*;* + !)■ ( 2 -3) 
fc=i 

Clearly, the items arrive in random order, so the process remains the 
same except that the insertions occur at the random times Tr n .u, . . . , T( n;n ), 
where Tr n ,j\ is the j:th order statistic of T\, . . . , T n . We thus have I n , m {k) = 
I{T{n;m)\ k ) and X n{t) = X n , m when T (n;m) < t < T (n;m+1) (with T (n . 0) := 
and T( n . n _|_i) := 1 for convenience). In particular, 

X*= maxX n (t). (2.4) 

Note that X n (0) = X nfi = and X n (l) = X n ^ n = 1. 

The importance of this randomization is that the variables I(t;k), k = 
1, . . . , n, are independent (both for a fixed i and as stochastic processes, i.e. 
as random functions of t). For every n, k and t E [0, 1], 

F(l(t;k) = 1) =F(T k <t)=t, (2.5) 

i.e. I(t; k) has the Bernoulli distribution Be(t). X n (t) thus is the number of 
runs of 1 in a sequence of independent 0's and l's, each with the distribution 
Be(t). Furthermore, the number of items sorted at time t is N n (t) ~ Bi(n, t). 
Define further, for < t < 1, the centralized variables 

I'(t; k) := I(t; k) — E I(t; k) = I(t; k) - t (2.6) 
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Sn,i(t) := JV(t;fc) = N n (t)-EN n (t) = N n (t) - nt, (2.7) 
fe=i 

n-1 

S^(t) :=^l'(t;fc)l'(i;A; + l). (2.8) 



fc=i 



Thus S nA (0) = S„, 2 (0) = S„,i(l) = 5 n , 2 (l) = and E5„,i(t) = E5„, 2 (t) = 
for all t € [0,1]. We have 

n 

N n (t) = fc) + 1) = 5„,i(t) + nt, (2.9) 

fe=l 

n— 1 n— 1 



I(t\ k)I(t; k + l) = J2(l'(t; k) + t) (I'(t; k + l)+t) 

k=l k=l 

= 5„, 2 (t) + *(2S„,i(t) - J'(t; 1) - I'(t; n)) + (n - l)t 2 , 

and thus from (12. 3p the representation 

X n (t) = n(t - t 2 ) + t 2 + (1 - 2t)S n ,i(i) - S n , 2 (t) + 1) + n) 

= nt(l - t) + (1 - 2t)5 n ,i(t) - S„, 2 (i) + R n (t), (2.10) 

where i^(t) := t 2 + 1) + tl'(t; n) and thus |i?„(t)| < 3. 

We will in Section |4] study the asymptotic distribution of the stochastic 
processes (i.e., random functions) S nj i(t) and 5„ j2 (t); our main results then 
follow easily from (pTI|) and ([2TTD]) . 

Note that for any fixed i, the variables /'(i; fo) are independent and have 
means 0; hence the terms in the sums in (j2.7j) and (|2.8p have means and 
all covariances 0. (They are thus orthogonal in L 2 .) It follows immediately 
that 

Var(S„,i(t)) =nE(/'(f;l)) 2 = n Var(/(t; 1)) = nt(l - 1), (2.11) 
Var (5 ft)2 (t)) = (n - l)E(/'(t; l)/'(t; 2)) 2 = (n - l)(Vax(/(t; l))) 2 

= (n- l)i 2 (l -t) 2 , (2.12) 
Cov(S n ,i(t),Sn,2(t)) =0. (2.13) 



3. Exact results 

We first give some exact results for finite n. It is easy to find the exact 
distribution of X„ )m for given n and m, see for example Stevens [22] or 
Mood [21]. For m > 1 and > 1 we have X„ jm = k if there are runs of 
l's separated by k — 1 runs of 0's and possibly preceded and/or succeeded 
by additional runs of 0's. Considering the bivariate generating function for 
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such sequences of arbitrary length, we easily find 

r{Xn „ = k) = , I - I r-](_£_)*(_i_) '- 1 ( r l-) 2 

= [x m - k y n ~ m - k+1 }(l - x)- k {l - y)- k - 1 
'm — 1\ /n — m + 1 N 

The mean can be computed from this [21], pQ, but simpler from (jl.lj) : 

n-l 

E^n.m = m- y^^{l n ,m{k)In,m{k + 1)) 
fc=l 

. m{m — 1) m(n — m + 1) , 

= m - n - 1)-) = 1 3.1 

n(n — 1) n 

A similar computation of the variance yields, omitting the details, 

m(m — l)(n — m){n — m + 1) 

Var A n m = — . 

n z (n — 1) 

If we instead randomize the insertion times as in Section [2] and consider 
the process at a fixed time t, we have by (|2.3j) . (|2.5p and the independence 
of I(t; k) for fc = 1, . . . , n, 

n-l 

EX n {t) = t + ^(1 - t)t = nt(l -t) + t 2 . (3.2) 
fc=i 

Similarly, using (|2.2p . again omitting details, 

VarX n (t) =nt(l-t)(l-3t + 3t 2 )+t 2 (l-t)(3-5t). (3.3) 

To find the exact distribution of X* seems much more complicated. Exact 
values of P(A"* = h) are easily calculated for small n, see af Hallstrom [I], 
but we do not know any general formula. It would be interesting to find 
such a formula by combinatorial methods. 

4. The asymptotic distribution of S n ,i(t) and S n ^{t) 

To state our results on the asymptotic distribution of the stochastic pro- 
cesses S nt i(t) and S nj 2(t), we need a suitable topological space of functions. 
We use, for an interval / CI, the standard space D(I) of right-continuous 
functions on / that have left-hand limits, equipped with the Skorohod topol- 
ogy. For a precise definition of this (metrizable) topology, see e.g. Billingsley 
[!](/= [0,1]), Jacod and Shiryaev [12] (I = [0,oo)), Kallenberg pH Ap- 
pendix A. 2] (/ = [0, oo)), or Janson [14]. For our purposes it is sufficient to 
know that if / is continuous on I, then f n — > f in D{I) if and only if f n — > f 
uniformly on every compact subinterval. In particular, if / is compact, for 
example I = [0, 1], and / is continuous on /, then f n — > / in D(I) if and 
only if /„ — > f uniformly. 
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Our main result on the asymptotic global behaviour of S Ut i(t) and S ni 2(t) 
then can be stated as follows. 

Theorem 4.1. As n — » oo, in D[0, 1], 

n- 1/2 5„,i(t) Zx(t), (4.1) 

n- 1/2 5 n , 2 (t) Z 2 (t), (4.2) 

jointly, where Z\ and Z 2 are two independent continuous Gaussian processes 
on [0, 1] with means EZi(i) = EZ 2 (t) = and covariances 

E(Zi(s)Z 1 (t)) = s(l-t), 0<s<t<l, (4.3) 

E(Z 2 (s)Z 2 (t)) =s 2 (l-t) 2 , 0<s<t<l. (4.4) 

Thus, Z\ is a standard Brownian bridge, and the limit (14. 1|) is just the 
well-known theorem that the empirical distribution function asymptotically 
is distributed as a Brownian bridge, see e.g. Billingsley [U Theorem 16.4]. 

The proof of Theorem 14. 1\ and of all other results in this section, are 
postponed to Section [H 

Using (|2.10p , Theorem 14.11 yields the asymptotic distribution of the pro- 
cess X n (t). 

Corollary 4.2. As n —* oo, in D[0, 1], 

n- l ' 2 (X n {t) - nt(l - t)) Z(t), (4.5) 

where Z is a continuous Gaussian process on [0, 1] with mean E Z(t) = 
and covariances, for < s < t < 1, 

E(Z(s)Z(t)) = s(l - 2s)(l - t)(l - 2t) + s 2 (l - t) 2 (4.6) 
= s(l-t)(l-s-2t + 3st). (4.7) 

In particular, this implies the limit (14. 5h for each fixed i, with Var(Z(i)) = 
t(l - t)(l - 3t + 3t 2 ), which also follows more easily from ([22]), ([3^]) . ([33]1 
and the Central Limit Theorem for 1-dependent sequences. 

These results are stated using the randomized insertions described in Sec- 
tion [21 We can also return to the original deterministic insertion times and 
obtain asymptotics of the discrete process (^n,m)m=0' wri ich yields Theo- 
rem 11.51 stated in the introduction. Note that the limit processes in Theo- 
rem [L5] and Corollary 14.21 are different, due to the additional random vari- 
ation introduced when randomizing the time. (The variance of the limit in 
Theorem 11.51 is strictly smaller than in Corollary 14.21 at every t £ {0, 5, 1}-) 

We will also need a moment estimate. It is easy to see that n 1 ^ 2 S n ^{t) has 
moments that are bounded as n — > 00, for every fixed t £ [0, 1]. We extend 
that to the supremum over all t. 

Theorem 4.3. Let S * i := sup 0<t<1 \S nt i(t)\ for i = 1,2. Then, for each 
fixed r > 0, E(S*_ i ) r = 0(W 2 ). 
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We are primarily interested in the maximum X* of X n (t). It is evident 
from Corollary 14.21 that the maximum is attained close to the maximum 
point of t{l — t), i.e., close to t = 1/2. We use a magnifying glass and study 
the processes close to t = 1/2 in greater detail. The correct scaling turns 
out to be t = | + xn -1 / 3 , and we have the following asymptotic behaviour 
on that scale. 

Theorem 4.4. As n — > oo ; in D(—oo, oo), 

n- 1/3 (5 n)1 (| + xn- 1 / 3 ) - S n>1 (i)) -±> B 1 (x), 
™- 1/3 (S n , 2 (i + zn- 1 / 3 ) _ 5 n , 2 (i)) 2- 1 / 2 i? 2 (x), 

jointly, where B\ and B2 are two independent Brownian motions on ( 
Furthermore, for any fixed A < 00 and i = 1,2, 

Emax (S n ,4 + xtT 1 ' 3 ) - S n ^\)f = 0{n 2 ' 3 ). 

Corollary 4.5. ^4s n — > 00, in D(—oo, 00), 

n-^(X n (l + xn- 1 ' 3 ) - *„(!)) 2- 1 /2 jB(x) _ 
where B is a Brownian motion on (—00, 00). 

5. TIME-REVERSAL 

In the proofs below, we will introduce factors that blow up at the endpoint 
t = 1. To see that there is no real problem at this endpoint, we will use 
a time reversal trick which enables us to transfer results from the other 
endpoint. 

If we replace each by 1 — T^, which of course has the same distribution, 
then I(t; k) becomes 1 — 1(1 — t;k), except at the jump point, and thus, see 
(j23D - (|T8D . I'(t; k) becomes -I'(l-t; k) and SW.iO) becomes (-1)^^(1-*), 
again excepting the jump points. To be precise, let for a function / on [0, 1], 
f(t—) := lim s ft /(s) (when this exists), with /(0— ) := /(0). Then S n> i(t) 
becomes (— iyS n< i((l — t)—) under this time-reversal, and thus 

S n)i (t)^(-lfS nii ((l-t)-), (5.1) 
as functions in D[0, 1] and jointly for i = 1, 2. 

6. Proofs 

The proofs are based on martingale theory, in particular a continuous 
time martingale limit theorem by Jacod and Shiryaev |12j . We will use 
the quadratic variation [X, X]t of a martingale X (in continuous time) and 
its bilinear extension [X, Y]t to two martingales X and Y. For a general 
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definition see e.g. [12]; for us it will suffice to know that, if X and Y are 
martingales of pathwise finite variation, then 

[X,Y] t = AX( S )AF( S ), (6.1) 

0<s<t 

where AX(s) := X(s) — X(s-) is the jump of X at s and, similarly, 
AY(s) := Y(s) — Y(s-). The sum in (|6.ip is formally uncountable, but 
in reality countable since there is only a countable number of jumps; in the 
applications below, the sum will be finite. 

A real-valued martingale X(s) on [0, t] is an L 2 -martingale if and only if 
E[X,X] t < oo and E|X(0)| 2 < oo, and then 

E\X(t)\ 2 = E[X,X] t + E\X(0)\ 2 . (6.2) 

We will use the following general result based on pJ2] ; see [15} Proposition 
9.1] for a detailed proof (for I = [0, oo); the general case is the same). (See 
also [13] and [14] for similar versions). 

Proposition 6.1. Let I = [a,b] or I = [a,b), with — oo < a < b < oo. 

Assume that for each n, M n (t) = (A^m(t))| =1 is a q- dimensional martingale 
on I with M n (a) = 0, and that £(i) = (Sjj(t))? J=1 is a (non-random) 
continuous matrix-valued function on I such that for every fixed t £ I and 
i,3 = 1, • • • ,Q, 

[Mni,M n j]t ^ij(t) qs n > oo, (6.3) 
supE[M ni , M ni } t < oo. (6.4) 



Then M. n — > M.00 as n — > 00, in D(I), where M.00 = (Mooi)1 = i 



is a 



continuous q-dimensional Gaussian martingale with E_Moo(i) = and co- 
variances 

E(M ooi (s)M ODj (t)) = Ey(«Ai), s,t£l. 
In other words, the components M n i(t) converge jointly to Mooi{t) in D(I). 

Remark 6.2. By ([672]), ([63)) is equivalent to sxxp n E\M n (t)\ 2 < 00, the 
form used in e.g. [15]. 



Proof of Theorem \4-l\ We first construct martingales from 5 nj i(i) and S n: 2(t). 
We define, for < t < 1, 



I(t;k) 



I'(t;k) Jl, I(t]k) = l, 



l-t [-t/(l-t), I(t;k) = Q; 

n 

S nA (t) :=Y,T(t;k) = (l-ty'Sn^t); (6.5) 



k=l 
n-1 



S n , 2 (t) := ^ k + !) = (!- t)- 2 S n , 2 (t). (6.6) 



k=l 
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We have E T(t; k) = and 
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E(j(f; kf) = Var (l(t; k)) = (1 - t)~ 2 Var(l(f; fc)) = — -. (6.7) 



It is easily checked that each I(t; k) is a martingale on [0, 1) [144 Lemma 
2.1]; since these martingales for different k are independent, the products 
I(t; k)I(t; k + 1) are martingales too, and thus S nj i(t) and S n> 2(t) are mar- 
tingales on [0,1) with 5 n) i(0) = £71,2(0) = 0. To calculate their quadratic 
variations and covariation, note that AI(t; k) = (1 — i) _1 when t = T^ and 
otherwise. Further, with J(t; 0) := I(t; n + 1) := 0, 



A5 ft) i(t) = ^A/(i;A;), 

fc=i 

n 

AS n , 2 (t) = ^ A J(t; k) (T(t; k - 1) + /(t; fc + 1)) , 



k=l 



and thus 



n n * 

[S n ,i, S nA ] t = £ £| AI(.; A;)| 2 = £ l[r fc < t], 

s<t fe=l fe=l ^ fc ^ 

n 

&,i> S nt2 ] t = £ £| AJ(s; A;)| 2 (/( S ; fc - 1) + /(a; k + 1)) 



s<t fc=l 



n 

[S n , 2 , S n ,2]t = El A ^ k )\ 2 (^ T(s; k + l)f 



s<t k=l 



(6.8 



n 1 

E (1 _ r)2 ( J (^; * - 1) + J(T fc ; A; + l))l[T fc < t], (6.9) 



n ^ 

E (1 _ rj , )2 (^ fc - !) + ? ( r ^ fc + i)) 21 ^ ^ *]■ ( 6 - 10 ) 
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Hence, since the Tk are independent and uniformly distributed on [0,1 
and using (|6.7p . 

t 



- - C* ds 
E[5„,i, S n ,i]t = n T = n 



1 



1 - s 



o 11 



" /"* ds 

E[5 n ,i, S„, 2 ] t = V / ™ E(/(s; fc - 1) + /(a; fe + 1)) = 0, (6.12) 

E[5 n , 2) S„, 2 ]t = V / -— ^ E(/(s; fc - 1) + /(a; k + l)) 2 



((n- 2) -2 + 2-1) / 
./o 



ds s , t 2 
(n- l)- 



(l-s) 2 l-s v '(l-t) 2 

(6.13) 



(Indeed, these formulas also follow directly from (|2.ll|) — (|2.l3|) by (|6.5p . (|6.6p 
and (|6.2p together with its polarized version for two martingales.) 

Moreover, the fc:th and Z:th terms in the sums in (|6.8p - (|6.l0p are indepen- 
dent when \k — l\ > 2, and each term is 0((l — i)~ 2 ). Hence, for i, j G {l, 2}, 

V a r([S n>i ,S n ,j]t) = 0(n(l-t)' 4 ). (6.14) 
Define now, for i = l, 2 and < t < l, 

S n ,i(t) := n-V 2 ^-^)- (6-15) 
By dSHTJ)— (JSII3D, for every fixed t 6 [0, l), 

[S n ,l> Sn,l]t — ^ 7~7 > 



[5 n ,i, <Sn,2]t — : > 0, 

[S n .2, S n) 2\t — 



t 2 



(i-tr 

Proposition I6.ll thus applies with I = [0,1), with (|6.4p verified by (|6.15p . 
(|6.1ip and (|6.13j) . which shows that 



n 



- 1/2 S ntl (t) = S nti (t) -±> Zi(t), i = 1,2, (6.16) 



jointly in D[0, 1), where Z\(t) and Z 2 (i) are continuous Gaussian processes 
on [0, 1) with means and covariances, for < s < t < 1, 

E(Z 1 ( S )Z 1 (t)) = (6.17) 

E(Zi(a)Z 2 (t))=0, (6.18) 

E(Z 2 (s)Z 2 (i)) = 7J3^)2- ( 6 - 19 ) 
Note that (|6.18p implies that Z\ and Z 2 are independent. 
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We define Z t {t) := (1 - t)%(t) for t £ [0,1), and Z^l) := 0. Then 
([636]) implies, by (JO) and lpfT5|l . that (jH]) and flOJ) hold jointly in Z?[0, 1). 
Furthermore, (|6.17p - (|6.19p imply that Z\ and Z 2 have the covariances (|4.3p 
and (jOl) . 

It remains to extend this from [0,1) to [0,1]. We use the time-reversal 
trick in Section [5] and have by (|5.ip and the result just shown 

n-^S^dl - t)-) = (-lyn-^S^it) (-l)^(i) ^ Z<(t) 

in J3[0,l), and thus ra~ 1/2 S„,i(i) ^(l-t) in .0(0,1]. Clearly Z f -(l-t) = 
Z(i), as processes on [0,1]; since Z{ is continuous on [0,1), this implies 
continuity at 1 too, and thus Z% is continuous on [0,1]. We have shown that 
the limits flH} and ([42]) hold (jointly) in both D[0, 1) and -0(0,1], which 
easily implies that they hold in -D[0, 1] too, see e.g. [14} Lemma 2.3]. □ 



Proof of Corollary Immediate by Theorem 14.11 and (|2.10p . with the 
limit Z(t) : = (1 - 2t)Z 1 (t) - Z 2 {t), since n _1 / 2 i?n(t) -> uniformly. The 
covariances (|4.6p ~ (|4.7p of Z(t) follow from (|4.3p . (|4.4p and the independence 
of Z\ and Z 2 - □ 



Proof of Theorem By Holder's inequality, it suffices to prove the result 
when r is an even integer. Since S* t < sup < t <iy 2 \S n ,i{t)\+su.p 1 / 2 <t<i |<Sn,i(*)| 
and the time-reversal symmetry (I5.ip implies 



sup \S n<i (t)\ = sup \S n>i (t)\, (6.20) 

l/2<t<l 0<<<l/2 

it is sufficient to consider sup 0<t<1 / 2 \S n ,i{t)\- Moreover, |i? nj j(i)| < 1 5^,1^)1, 
and by Doob's maximal inequality for martingales, see e.g. Proposition 
7.16], 

E( sup \S nA {t)\) r < e( sup S n>i (t) Y < C r ^( S n ,i{\) Y, (6.21) 

0<t<l/2 ' V 0<t<l/2 / \ / 

for some constant C r (= (r/(r — l)) r ). 

Finally, 5^,1 (5) is t ne sum °f n independent random variables I(^;k), 
each with values ±1 and mean 0, and it is easily verified that, with r = 2£, 

(\s n>i (±)\y = 0{n t ) = 0{n r / 2 ). (6.22) 

Similarly, S n ^{\) is the sum of the n — 1 random variables J(^; + 
1); these variables too have values ±1 and mean 0; moreover, it is easily 

verified that they too are independent. Hence S n>2 ,{\) = £^-1,1(5)) and 
(|6.22p implies the same estimate for 18^,2(5) too. 

The result follows by this, ([531]) and (16301) . □ 

Proof of Theorem We claim that Corollary 14.21 implies that 

n-^maxX^t) - in) -±> Z{\) := -Z 2 {\\ (6.23) 



E 
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which gives (II. 3p by (14. 4p . (We could use Theorem 11.51 instead.) The ar- 
gument was sketched in the introduction, and this is an application of |14} 
Theorem 16], but for completeness we give the details in our case. We may 
for simplicity use the Skorohod coupling theorem }16l Theorem 4.30], which 
says that we can assume that (|4.5p holds with convergence a.s. and not just 
in distribution. Thus, for (almost) every point in our probability space, 
n~ l < 2 (X n (i) — nt(l — t)) — * Z(t) in D[0, 1], which since Z(t) is continuous 
means uniform convergence on [0,1]. In other words, uniformly in t £ [0,1], 

X n {t) =nt(l-t) +n 1 l 2 Z(t) + oin 1 ' 2 ) = \n-n{\-t) 2 + n l / 2 Z(t) + o(n 1 / 2 ). 

(6.24) 

In particular, 

X* > X n {\) = \n + n l ' 2 Z(\) + o(n 1 / 2 ). (6.25) 
Conversely, (|6.24p yields for \t — ^| < ra -1 / 8 , since Z is continuous, 

X n (t) <\n + n x l 2 Z{t) + o{n 1 ' 2 ) = \n + n x l 2 Z{\) + o^ 1 / 2 ), (6.26) 
and for \t — l| > n -1 / 8 , since Z is bounded, 

X n (t) <\n- n 1 " 1 / 4 + 0(n l l 2 ) <\n + n x l 2 Z(\) (6.27) 
for large n. It follows from flE25]), <$§M§ and (^27]) that 
X* n = \n + n 1 / 2 Z{\) + o{ n y% 

and HEZ5H follows. 

To prove moment convergence in (jl.3p . it is, as is well-known, see e.g. 
|1H Theorems 5.4.2 and 5.5.9], enough to prove that for each fixed r > 0, 
the r:th absolute moment of the left hand side is bounded, asn-* oo. By 

4210]), 

I^C ~ \ n \ = | su P^n(i) — suprat(l — 1)\ < sup|X n (t) — nt(l — t)\ 

t t t 

and the required estimate follows by Theorem 14.31 □ 

Proof of Theorem \1.5l Recall the order statistics Tr n;m \ from Section[2l Since 
X n ,m = X n (T (n;m) ), we are studying the process X„ iLn4 j = X„(T( n .|_„ t j)). 
The idea of the proof is to use the functional limit results just shown and 
replace t by the random time T( n . i nt j \ . Note first that N n (T( n;m ) ) = m and 
thus 

JV n (r (n . LntJ) ) = \nt\ =nt + 0(1). (6.28) 

By (HZ]), 

sup \N n (Tr n . m )) / n - Tf n . m )\ < sup \N n (t)/n - t\ = sup In" 1 S n< i(t)\ 

0<m<n 0<t<l 0<t<l 

= n- 1 S* nA , (6-29) 
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which by (|4.1|) (or Theorem 14.31 or the Glivenko-Cantelli theorem |16^ 
Proposition 4.24]) tends to in probability. Thus, by (j6.28H . 

sup \t - Tr n .\ nt \) \ < sup \Nn(T( n -\nt\))/n - T (n;MJ)| + n' 1 0. (6.30) 
0<t<l 0<i<l 

The proof of Corollary 14.21 shows that (|4.5p holds jointly with (|4.ip and 
(jOD . with Z(t) = (1 - 2t)Z 1 (t) - Z 2 (t). Furthermore, by ([277|1 . N n (t)/n = 
t + S n x(t)/n, and a Taylor expansion of the function 1 1— > ni(l — t) yields 

JV n (t)(l - JVn(t)/n) = nt(l - t) + (1 - 2t)S nA (t) - S n>1 {tf/n. 
Consequently, by ([4TT]) . in D[0, 1], still jointly with (f4"3j) . 

n- 1 / 2 ^ n (t)(l-AT n (t)/ n ) _„t(i-t)) -1* (l-2t)Zi(t), 
and subtracting this from (14. 5j) yields 

n-V 2 ^ ft (t) - JV n (t)(l - N n {t)/n))) -±> Z{t) - (1 - 2t)Z 1 (t) = -Z 2 (t). 

(6.31) 

Because (I6.30P holds and Z 2 (t) is continuous, we may replace t by Tr n .\ nt \\ 
on the left hand side; for a formal verification of this we may again use the 
Skorohod coupling theorem [161 Theorem 4.30] and thus assume that (j6.30j) 
and (|6.3ip hold a.s., i.e. that the functions in (|6.30p and (|6.3ip converge 
uniformly on [0,1] to their limits. Consequently, 

n- l ' 2 (x n ^ - iV n (T (n . LntJ) )(l - 7V„(T (n;LntJ) )/n)) -±> -Z 2 (t), (6.32) 

which by (IQKD yields JT5]) with Z(t) = -Z 2 (t). □ 



The fact that the terms with S n ,i cancel in the proof above is no coinci- 
dence. S n7 i measures by (12. 7j) the random fluctuations introduced by used 
random insertion times Tj. , and it is very intuitive that this term will appear 
in the limits for X n (t) but not for X n>TO . A theorem verifying that this can- 
cellation happens in general in a situation closely related to the one studied 
here is given in [Til Theorem 7]. 

Proof of Theorem \4Jj\ Fix A > 0, and define for n > (2A) 3 and x G [0, 2A], 
W n4 {x) := S n4 (± + (x- A)^ 1 ' 3 ) - 5 n>< (i - An- 1 / 3 ). (6.33) 

Then W n i is a martingale on [0,2^4] with W n i(0) = 0, and its quadratic 
variation is by (|6.ip 

[W nj i,W n j] x = [S n> i, S n> i] l+( x —A) n -i/3 — [S n ,ii S nj i]i_ An -i/3- (6.34) 
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Hence, by (l6TTll - (|03j ). for < x < 2A, 

E[W n ,i, W nil ) x = n / -2 = n 2 / 3 x(4 + 0{n~ l l 3 )), (6.35) 

E[W n ,i,W n , 2 ]x = 0, (6.36) 
E[VFn,2, ^, 2 ], = (2n - 2) / = 2n 2 / 3 x(4 + O^- 1 / 3 )). 

Jl- An -l/3 (1 - S) d 

(6.37) 

Moreover, by (|6.34[) and (|6.14j) . for n > (4A) 3 , say, 

Var[W nii ,W nj -] a = 0(n). 

Consequently, Proposition ^. ll applies to rr x l 3 W n ^, and shows that in D[0, 2A] 
and jointly for i = 1, 2, 

d 



n~ 1/3 VMx) W(x), (6.38) 

where Wi and W 2 are independent Gaussian stochastic processes with means 
and 

E(Wi (a?) Wi (y))= 4x, E(W 2 (aj)W 2 (y)) = 8x, < x < y < 2A 

In other words, Wi(x) = 2£?i(x) and W%(x) = y/8B2(x), where B\ and 
B2 are independent Brownian motions on [0, 2A\. We may assume that B\ 
and B2 actually are independent two-sided Brownian motions defined on 

the entire real line. Note that Bi(x + A) — B{(A) = B{(x) (as processes on 
R). Hence we can make a translation and obtain from (|6.33p and (|6.38p . in 
D[— A, A] and jointly for % = 1, 2, 

n-Wfin^ + xn- 1 ' 3 ) -5 n ,i(i)) =n- 1 /\W n , i (x + A)-W n M)) 

2 ( <+ i)/2 + A) _ d 2 (i+i)/2 B . (x) _ (6 _ 39) 

By ()6.5p and (|6.6p we further have, uniformly for n > (4^4) 3 and x 6 

<Sn,i ( 2 xn ^ ) — 5n,j ( 2) = ( 2 — ^ ) (2 ^ ) ~~ (2) ' ^n,i (2) 

= 2- i (s n , i (l+xn- 1 / 3 ) -5 n>i (i)) + 0(n-V35V). 

(6.40) 

and thus, using (|6.39p and Theorem l4.31 in A] and jointly for i = 1,2, 

n- 1/3 (S n ,(l + xn" 1 / 3 ) _ fl^(D) 2-^ +1 )/ 2 J B 4 (x) = ^^B^x). 

Since convergence in D[— A, A] for every A > implies convergence in 
D(— 00,00), this proves (|4~8l) and (H~9l) . 
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For the second moment estimate (I4.10p . we first note that (I6.2p . (|6.35l) 
and (|6.37p show that, for each fixed A, 

E \W n ,i(2A)\ 2 = E[W nji , W n ,i} 2 A = 0(n 2 / 3 ), (6.41) 

and thus by Doob's maximal inequality [HI Proposition 7.16], 

E( max \W ni (x)\ 2 ) = 0(n 2/3 ). 

Hence, by f|6.33j) and translation again, 

Emaxj^^i + xn- 1 / 3 ) - S n ^)\ 2 = E max| W^Oz + A) - W n ,i(A)\ 2 

<4E max I W n i{x)\ 2 = 0(n 2/3 ), 

0<x<2A ] ' 1 

and dUD} follows by (JOU]) and Theorem E2 □ 

Remark 6.3. We can extend (|4.10p to arbitrary powers r > 0, with the 
estimate 0(n r / 3 ), by the argument above with the Burkholder-Davis-Gundy 
inequalities |16|, Theorem 26.12] replacing (|6.4ip ; we omit the details. 

To study X n (t) close to t = 1/2, we rewrite (12. 10f) as, for |x| < 1/2, 

X n {\ + x) = \n-nx 2 -2xS nA {\ + x) -S n , 2 {? + x) + R n {\ + x). (6.42) 

Hence, still for \x\ < 1/2, 

X n (\ +x)- X n (\) = -nx 2 - 2xS nA (\ +x)- {S n , 2 (± + x) - S n , 2 (%)) 

+ R n {± + x) - R n (±) (6.43) 

and thus, for \x\ < n 1 / 3 /2, recalling \R n (t)\ < 3, 

n-V3( Xn (i + xn"V3) _ Xn{ ^ = _ x 2 _ 2n -2/3 xSn ^i + xn -i/ 3) 

-n-V3( 5ni2 (i +xn -i/3) -5 n , 2 (i)) +0(n- 1 /3). ( 6 .44) 

Proof of Corollary gg| Fix ,4 > 0. Then (liTTil follows in A] by 

dS3U), (BSD and Theorem S3] (which implies n -2 / 3 ^ -^-> 0), with £(sc) := 
—B 2 (x). Since A > is arbitrary, this yields convergence in D(—oo, oo). □ 

Let i + :=iV 0. 

Lemma 6.4. Let x\ > and suppose that A4(x) is a martingale on [0,x\] 
with M(0) = such that for some constant K and all x £ [0, x\] 

VaiM(x) < Kx. 

Then, for every a > and xq £ (0, xi], 

E( max (Mix) - ax 2 )^) < — . 
Vio<a;<^i +/ clxq 
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Proof. We may for convenience extend M to a martingale on [0, oo) by 
letting M(x) := M(x\) for x > x\. Let y > and £ > 0. Then, by 
Kolmogorov-Doob's inequality \16\ Proposition 7.16], [11, Theorem 10.9.1], 

sup (M(x) - ax 2 ) >t) < P( sup M(x) > t + ay 2 

x€[y,2y] 1 K x<^,2y\ 

< EM(2y) 2 < 2ETy 



(t + ay 2 ) 2 " (t + ay 2 ) 2 ' 
Integrating with respect to t from to oo yields 

2Ky 2Ky 2K 



E( sup (A-l(s) - ax 2 ) ) < / 
xe[v,2vl •/ 



xe[j/,2j/] + Jo (t + ay 2 ) 2 ay 2 ay' 

and the result follows by summing over y = 2 k XQ, k = 0, 1, ... . □ 

Proof of Theorem \1.3[ We begin by showing that we can replace X n ^ n /2\ by 
X n (^) in the statement. By (|6.29p and Theorem 14.31 

n^N^T^i^/n - T (n;[n/21) | < n^Xi 
and thus by (|6T28j) 

^ 1/3 |r (n;rn / 2l) -i| -^0. (6.45) 
It now follows from Corollary 14.51 arguing as for (|6.32j) and using (|6.45p 
and the fact that the limit in (|4.1ip is continuous, that we can substitute 
x = n 1 ^(T( n .^ n /2\) — |) in (|4.11|) and obtain 

n- x / 3 (X nirn/2l - X n {\)) = n-^iXniT^/^) - X n (\)) ~^ 0. (6.46) 

Furthermore, by (|3.1|) and (|3.2|) . 

M nJn/2l = ±n + 0(l)=EX n (±)+0(l). (6.47) 

Hence, it is enough to prove Theorem 11.31 with X n r n /2] replaced by X n (|); 
we thus study 

M n := X* - X n (\) = max (X n (t) - X n (\)). (6.48) 

We would like to take the supremum over all real x in (|4.1ip . but that 
is not allowed without further arguments since the supremum is not a con- 
tinuous functional on D(—oo, oo) (the topology is too weak). We therefore 
fix a large A > and study the following five intervals separately (assuming 
n > {4Af): 



L 



I -i 
h 
h 
h 



[0, i] 



[i-yln-^i+An- 1 / 3 ], 
[i+An- 1 / 3 ,!], 

3 

■V 



^ 1]. 
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We denote further 

M nJ := max(X n (t) - X n {\)) + 
and have thus, since M n > 0, 

2 

M n = max M ni < M nj . (6.49) 

-2<i<2 ,J ^— ' ' J 
j=-2 

On Jo we use (|4.1ip . Since the maximum is a continuous functional on D(I) 
for any compact interval /, we obtain from (|4.11|) on D[— A, A] immediately 

n- 1/3 M n ^Uv A := max (2~ 1/2 B(x) - x 2 ) . (6.50) 

|x|<A 

Furthermore, it follows from (|6.44[) . Theorem 14.31 and (|4.10p that 

E(n- 1 / 3 M n , ) 2 < C(A), 

for some constant C(A) depending on A but not on n. Hence the random 
variables n~ 1//3 M nj o are uniformly integrable, and (|6.50p implies, see e.g. 
[TT1 Theorems 5.4.2 and 5.5.9], 

E(n~ 1 / 3 M ni0 ) ->EV A . (6.51) 

On I±2 we have by (|6.43p . with -r < \x\ < ^, 

X n (\ + x)-X n {\) < -n(i) 2 +5*! + 2^ + 6. 

We use the elementary inequality, for a > and i)£R, 

(a -6/2) 2 b 2 b 2 ,„ rn , 

-a + 6 = -^ — + —<—, 6.52 

a 4a 4a 



and obtain 

/ o* i o c* ^2 i o* \2 /■ o* 

i,2 



/ Q* I oc* "i2 / a* \2 I a* \2 

\ 71,1 H,2/ , ~ „ oWn,!/ 1 , „„Wn,: 



M n , ±2 < '-, "' + 6 < 8 ^ + 32 ^ + 6 

An /lb n n 



and thus by Theorem 14.31 

EM n , ±2 = 0(l). (6.53) 

For 7i we define 

^n(x) :=-\(S n ,2{\ + x)-S n ,2{\)); (6.54) 
this is a martingale on [0, 1/2). For < x < j, we have 

\S n ,2{\ + x) = (l-2x)- 2 S ny2 {\ + x) = (l + 0{x))S n>2 {± + x), 
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and thus, using (16.430 and (|6.52j) . for some constants Ci, C%, . . . , 

= -nx 2 -2xS r ^{\ + x)+U ri {x)+0{x)S n ^{\ + x)+0{l) 
< U n (x) - \nx 2 + S; hl + dS^ 2 - \nx 2 + 0(1) 

( a* \2 / c* \2 

<U n {x)-\nx 2 + C 2 -^— + Cz—^— + 0{l). (6.55) 
By d5U, dEMD and ([BTT31) we further have, for < x < 1/4, 

Var(f7 n (x)) = E[J7 n , C/J^ = ^ e([S„ )2 , 5„ )2 ] i +a . - [Sn,2, S n>2 ] i 

^ ■ ds < C^nx. 



16 7i (l-s) : 

2 v y 

Hence, Lemma l6~4l yields, for < x$ < \, 

E( max (f/ n (x) — hnx 2 ) ) < 



C 5 n C 5 

Taking xo = An~ l l^ we thus obtain from (|6.55p . using Theorem 14.31 again. 

EM n ,i < +0(1) = ^n x /3 + 0(1). (6.56) 

We obtain the same estimate for M n _\ by the time-reversal tt—^l — t and 

By (IQO]) and the estimates (IfOvfl) for M n ,± 2 and (I63HI) for M„,±i we find 
E \M n - M nfi \ < E M n ,_ 2 + E M n _i + E M n ,i + E M n , 2 < O e + C 7 n 1/3 A4, 
and thus 

limsupE|n~ 1/3 M„ - n _1/3 M ni o| < C 7 /A. (6.57) 
Now let A — > oo; then 

Vk -> ^oo := max(2 -1 / 2 .B(a;) - x 2 ) . (6.58) 

Note that, letting x = y/2, with V as in the statement of the theorem, 
Foo = max(2- 1 / 2 B(y/2) - (y/2) 2 ) = nuK^flfo) - iy 2 ) = \V. (6.59) 



It follows from (|6.57p and (|6.58p that we may let A — > oo in (|6.50p and 
obtain 

n -i/3 Mn VoD , (6.60) 

see [U Theorem 4.2] (we may change the notation and denote M n fl by M n -A 
for n > (4yl) 3 ; for smaller n we simply let M n -A = 0). 
Similarly, by (f63Tl) and ([Q7]l . 

lim sup El n~ 1 / 3 M n -EV^ I < Ot^" 1 + lEV^ -EV^I. 
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As A — ► oo, KVa, — ► EV^ by monotone convergence, and thus we obtain 
limsup n _ >00 E|n" 1 / 3 M n -EV^I = 0, i.e., 

n -1 / 3 EAf n -^EFoo. (6.61) 
The theorem follows by (16301) , $6M\i , $6Mb , (1Q6J) , flMED and (l639|) . □ 

7. Further results 

Consider Xnjn, the number of piles with a single exam (runs of length 
1) mentioned in Section [TJ If we for simplicity consider the cyclic case, see 
Remark 11.41 to avoid edge effects (these are O(l) only and do not affect the 
asymptotics), we have 

n 

= X> " W*0) W fc + !)(! " + 2)). 

fc=l 

After randomizing the time as in Section^ we get (with I(t; k+n) = I(t; k)) 

n 

x£\t)=Y,{l-I(t-k))l(t;k + l)(l-I(t;k + 2)) (7.1) 

k=l 

n 

= XX 1 - * - 7 '(*; (* + 7 '(*; fc + !))(!-*- + 2)) 

k=l 

= nt(l - tf + (1 - 3t)(l - t)5 nil (t) - 2(1 - t)£„, 2 (t) + tS£, 2 (t) + S n , 3 (t), 

where we now define S n ^ by summing to n in (|2.8p . and we introduce two 
new stochastic processes 

n 

S' n ^t):=J2l'(t;k)l'(t;k + 2), (7.2) 

fc=i 
n 

fc=l 

The proof of Theorem 14.11 extends to these and yields, in D[0, 1] and 
jointly with each other and (|4.ip and (|4.2[) . 

n- 1/2 5 n , 2 (t) Z£(i), (7.4) 

n- 1/2 5„, 3 (t) -±> Z 3 (i), (7.5) 

where Z 2 and Z3 are two continuous Gaussian processes on [0, 1] with means 
and covariances 

E(Z' 2 (s)Z' 2 {t)) = s 2 (l -tf, 0<*<t<l, (7.6) 

E(Z 3 (s)Z 3 (t)) = s 3 (l -t) 3 , 0<s<t<l. (7.7) 

Furthermore, all four processes Z\, Z 2 , Z' 2 and Z3 are independent. (Note 
that Z 2 and Z 2 have the same distribution but are independent.) 
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By the arguments in Section El which extend without any new difficulties, 
this yields the following results, corresponding to our results for X n ^ m and 

X n {t) in Sections □ and H We define X^* := x£\t), 
and note that Eln'(t) = nt(l — t) 2 has (on [0,1]) a unique maximum at 
t = 1/3. 

Theorem 7.1. As n — > oo, in D[0, 1], 
n-V*{xV{t)-nt{l-t?) 

-±> Z(t) := (1 - t)(l - 3t)Zi(t) - 2(1 - t)Z 2 (t) + t^(t) + Z 3 (t); 
Z is a continuous Gaussian process on [0, 1] with mean EZ(i) = 0. 
Theorem 7.2. As n — ► oo, m D[0, 1], 

where Z is a continuous Gaussian process on [0, 1] with mean E Z(i) = 0. 

We leave the explicit formulas for (co)variances in these theorems to the 
reader. 

Theorem 7.3. As n — > oo, 

n-^(x£>-±n)^N(0,Jf 9 ), 
with convergence of all moments. In particular, 

EX£> = ±n + o(ny% 
76 

VaxX<p* = —n + o(n). 
Theorem 7.4. As n — > oo, in D(—oo, oo), 

rr 1 ^ >( j + xn-V3) _ lW(i)) JL* y| fl(x) _ ^ 

where B is a Brownian motion on (—00,00). 
Theorem 7.5. ^4s n — * 00, 

n -v3 (x ci)*_ x w n/3j) ^ w 

where the random variable V is as in Theorem\TM and (3 := 2 7 /3 3 -8/3 5 2/3 = 
^(150) 1//3 . Furthermore, 

EX^> =ElJ' +/3EFn 1 / 3 + o(n 1/3 ) = |n + |3E7)i 1/3 + o(n 1/3 ). 

These results are easily extended to the number xj^m of piles with ex- 
actly d items (runs with exactly d l's) for any fixed d. We may also count 
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occurrences of any other fixed pattern, and more generally any functional of 
the type 

n 

X n , m ■= Yj^i^mik), In,m(k + £ ~ 1)) (7-8) 

k=l 

for some fixed £ > 1 and function ip : {0, 1}^ — > IEL We will pursue this in 
some detail, leave some other details to the reader, because the more general 
version illuminates the arguments above and the structure of our method. 
First, randomizing the time yields 

n 

X n (t):=J2^k(t), (7.9) 

k=l 

where we define *&(£) = ^{l{t; k),... ,I(t; k + I — 1)). We note that we 
will need more processes of the type S n j. It turns out that it is natural to 
use finite sequences of 0's and l's to index these processes; we thus change 
the notation and define a stochastic process S n - a (t) for each such sequence 
a = a\ ■ ■ ■ ct£ by 

n 

Sn;a(t):=Y, U ^k+j). (7.10) 

k=ije{i,...,t}, 

We thus now denote S n>1 (t), S nt2 (t), S' n2 (t), S n<3 (t) by S n; i(t), S n]U (t), 
S n ;ioi(t), S n -m(t). Initial and final 0's in a do not affect S n;a , so it is 
enough to consider a that begin and end with 1; let A be the set of all such 
strings a. 

Let £(a) denote the length of a and v{p) the number of l's in a, and 
consider only n > 2£(a). Then the terms in the sum in (I7,10p are orthogonal 

and we obtain E(S' n;a (t)) 2 = n(t(l — i))"^ . Moreover, S n;a (t) := (1 — 
£)~K a ) S n - a (t) is a martingale on [0, 1), and the proof of Theorem 14.11 extends 
immediately and shows that, in D[0, 1] and jointly for all a £ A, 

n-^ 2 S n . a (t) -i+ Z a (t), (7.11) 

where Z a , a S A, are independent continuous Gaussian processes with 
means and covariances 

E(Z a {s)Z a (t)) =s" (a \l-t) uia \ 0<s<t<l. (7.12) 

Furthermore, the estimate Theorem 14.31 extends to every S n - a (with the 
implicit constant possibly depending on a). 

A functional X n)Tn of the type (|7.8p yields after randomizing the time the 
functional X n (t) in (|7.9p . which always can be expanded as a finite sum 
(with orthogonal terms) 

X n (t) =g (t)n+ 9<x(t)S n -At), (7.13) 

aeA, e(a)<e 
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for some polynomials go(t) and g a (t), a £ A; this is seen by the same 
argument as in [14} Proposition 4.1]. Note that, for any n > 21 and any k, 

fld(0 = ^^=E **(<)• (7-14) 
n 

It follows from (I7TTI) and (1713!) that, cf. Corollary gjU in D[0, 1], 

n- x / 2 (X„(t) - ng (t)) -±> Z(t) := ]T <? Q (t)Z Q (i), (7.15) 

which is a continuous Gaussian process with mean and covariance function 
E(Z(s)Z(t)) = a(s,t) :=Y,9a(s)g a (t)(sAtr ( ~ a \l-sVty^. (7.16) 

a 

We have moment convergence in fj7. 1 5H : moreover, the variance of n -1 / 2 X n (t) 
is independent oi n >2£ and we have, for any k and n > 2£, 

e-i 

a(s,t) = n' 1 Cov(X n (s),X n (t)) = ^ Cov(* fc (s), V k+j (t)) . (7.17) 

j=-(<-i) 

Similarly, arguing as in the proof of Theorem 11.51 and observing that the 
S n -i terms cancel because gi(t) = g'o{t), we see that, in D[Q, 1], 

n-V 2 (X nt[nt] - ng (t)) -±> Z\t) := ^ g a (t)Z a (t), (7.18) 

another continuous Gaussian process with mean 0. 

Now suppose that go(t) has a unique maximum on [0, 1] at an interior 
point to, with #q(£o) < 0- Then all remaining proofs in Section [6] extend too 
without difficulties. In particular, if we define X* := max m X„ jm , we have 
the following generalization of Theorem 11.21 

Theorem 7.6. As n —* oo, 

n- 1 / 2 ^*- g(t )n) ^ N(0,a 2 ), 
with convergence of all moments, with, see (17.161) . 

a 2 := a(t ,t ) = 9a(t ) 2 (t (l - t )) u{a) . 
a 

Furthermore, cf. Theorem 14.41 in D(—oo, oo) and jointly for all a, 

n- 1/3 {S n;a (t + xn" 1 / 3 ) - S n;a (t )) -±> a a B a (x), (7.19) 

where a 2 := v(a)(to(l — to)) ^ 1 and B a , a € A, are independent Brown- 
ian motions on (— oo, oo). As a consequence, cf. Corollary |4.51 in D(— oo, oo), 

n- l l\X n (t + xn- 1 / 3 ) - X n (t )) -±> a*B(x) - \\g' \t Q )\x 2 , (7.20) 
where B is a Brownian motion on (—00,00) and 

o-t ■= Y,9a(to) 2 o- 2 Q = Y,9M 2 u(a){t {\ - to))"™- 1 . (7.21) 
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Finally, substituting x = (c*/|<7o (to)\) 2 ^ 3 y, we see that sup a . eK (cr*B(x) — 
±\g' \t )\x 2 ) ±PV, with 

/? := (4/Wo(to)\) 1/3 = K 2 ) 2/3 |^'(to)r 1/3 , (7.22) 
and we obtain the following, where X n (to) may be replaced by X n:iriQ , where 
either uiq := |_^o n J or m is chosen in {0, ... ,n} to maximize El n)m[) . 

Theorem 7.7. ^4s n —>■ oo, 

n-V 3 (X*-X n (t )) -±+pV, (7.23) 

where the random variable V is as in Theorem M.'A and (3 is given by (|7.22|) 
and (|7.2ip . Furthermore, 

EX* = EX(t ) +(3EVn 1/3 + o(n 1/3 ) = g {t )n + (3E Vn 1/3 + o(n 1/3 ). 

For calculation of the asymptotic variances a 2 and above, the given 
formulas using the coefficients g a (to) in the decomposition (|7.13p are often 
not very convenient. For a 2 , it is usually simpler to use (|7.17p with s = t = 
to- 

For a 2 we first observe that if we take the difference of the left derivative 
of a(s, t) with respect to s and the right derivative with respect to t at (to, to) 
(thus considering s < t only), we obtain from (|7.16p and (|7.2ip easily 



d 



d 



ai = —a(s,t ) -—a(t ,t) , (7.24) 

c/s s=t - at t=to+ 

a formula given by Daniels [6] (in a slightly different setting). It follows by 
the mean value theorem and (|7.17p that, for any fixed n > 2£, 

al = lim - (a{t + h,t + h)- 2a(t ,t + h) + a(t ,t )) (7.25) 

= lim — Var(X n (to + h) - X n (t )). (7.26) 

For fixed n, the probability that exactly one I(t; k) changes from to I in 
the interval [to, to + h] is nh + 0(h 2 ) and the probability that more than one 
will change is 0{h 2 ). Hence, if AkX n (t) is the function of {I(t; fc+i)}i<U|<^ 
that gives the jump in X n (t) (for n > 21) if I(t; k) is changed from to 1, 
keeping all other indicators fixed, then (|7.26p implies that, for any k and 
n > 2t, 



a 



2 



Var(A fc X n (to))- (7.27) 
For example, for the number of runs, 

A k X n (t) = (1 - I(t; k-1))- I(t; k + l) 

and a 2 = Var(/(£o; k — 1) + I (to; k + l)) = 2*o(l — to) = |j m accordance 
with Corollary 14.51 For runs of length 1 we similarly get, from (|7.ip . 

A k X^ (t) = - (1 - J(t; k - 2))l(t; k-1) 

+ (I - I(t; k - 1)) (1 - I(t; k)) - I(t; k + l)(l - I(t; k + 2)) 
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and (|7.27p yields, in accordance with Theorem 17.41 



a 2 = Var(A fe X«(i )) 



80 
81 



More generally, for Xn,m (runs of length exactly d), we find from (|7.14p . 



(|7.17p and (|7.27p . after straightforward calculations, go(t) = t d (l — t) 2 , to = 
d/{d + 2), g»(to) = -2d d "V(d + 2) d ~\ and 



For priority queues, the 2n events Ai and Di come in random order, with 
the restriction that A4 comes before Di for each i. Since only the order 
of the events matters, we may randomize the times as in Section [2] and 
assume that the times Ai and Di, i = l,...,n, are independent random 
variables uniformly distributed on (0, 1), conditioned on Ai < Di for all i. 
For two independent random variables T, T ~ [7(0,1), the distribution of 
(T, T) conditioned on T < T equals the distribution of (T A T, T V T), and 
this randomization of the times in a priority queue thus gives exactly the 
model for lazy hashing defined in Section [T] as found by Kenyon and Vitter 

[17] . In particular, max t Y n (t) = Y*. 

In analogy with the definitions in Section [21 we now let 




8. Priority queues and lazy hashing 



I(t;k) 
I'(t;k) 



l[T k < t], 
I(t;k)-t, 



I{t;k) := l[T k <t], 
I'(t; k) := I(t; k) - t, 



n n 



S n ,i(t) 



k=i fe=i 



n 



s n , 2 {t) 



Y,l'(t;k)f'(t;k). 



k=l 
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We further let N n (t) be the number of events (A k or D k ) up to t. Then, cf. 

fl23op ; 

n n 

N n (t) = I(t; k) + Y, k) = S n ,i(t) + 2nt, (8.1) 
fc=i fc=i 

n n 

Y n (t) = U A k <t<D k } = J2{l[T k <t<f k } + l[f k < t < T k ]) 

k=l k=l 

n 

= ^(l(t; k)(l - I(t; fc)) + I(t; k)(l- I(t- fc))) 

k=l 

= 2nt{\ - t) + (1 - 2t)S n ,i(t) - 2S n , 2 (t). (8.2) 

We introduce martingales S nt i and S n) 2 as above by (|6.5p and f)6.6|) . 

All proofs in Section[6]now go through with no or minor changes; the main 
differences are that f)8. If) and (|8.2p contain some factors 2 not appearing in 
(|2.9p and (|2.10p and that there will be a factor 2 on the right hand side of 
(|6.11j) ; thus Theorem 14.11 holds with the difference that (14. 3p is replaced by 

E(Z 1 (s)Zi(t)) = 2s(l-f), 0<s<i<l; 

similarly, (fO|) holds with Bi(x) replaced by 2 l / 2 Bi{x) in P~8j) . This yields 
the following results, corresponding to our results for J n>m and X n (t) in 
Sections [T] and HI 

Theorem 8.1. As n — ► oo, m L>[0, 1], 

n _1 / 2 (y n (t) - 2ni(l - *)) Z(t) := (1 - 2t)Z l {t) - 2Z 2 (t), 

where Z is a continuous Gaussian process on [0, 1] with mean E Z(t) = 
and covariances, for < s < i < 1, 

E(Z(s)Z(i)) = 2s(l - 2s)(l - t)(l - 2i) + 4s 2 (l - t) 2 
= 2a(l - t) - 4s(l - s)t(l - t). 

Theorem 8.2. As n — > oo, m -D[0, 1], 

n- 1/2 (y„, L 2n 4 j - 2nt(l - t)) Z(t) := -2Z 2 (t), 

where Z is a continuous Gaussian process on [0, 1] with mean E Z(t) = 
and covariances 

E(Z(s)Z(t)) = 4s 2 (l -t) 2 , 0<s<t<l. 
Theorem 8.3. ^4s n — > oo ; m D(—oo, oo), 

„-V»(y B (l + xn"^) _ y B (i)) a^Sfx) - 2x 2 , 
where B is a Brownian motion on (—00,00). 
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Theorem 8.4. As n — > oo, 

with convergence of all moments. In particular, 

EY* = n/2 + o{n 1 ' 2 ), 
Vary n * = n/4 + o(n). 

Theorem 8.5. As n — > oo, 

n- 1/3 (^-yn,n)^^, 

where the random variable V is as in Theorem \1.S\ and 

EY* =EY n , n + EVn 1/3 + o(n 1/3 ) = \n + E Vn 1/3 + o(n 1/3 ). 

Theorem 18.11 is given by Louchard |19j . Theorem 18,21 by Louchard [18J 
(with a deterministic change of time, making the problem equivalent to a 
queueing problem), and Theorems 18.41 and 18.51 by Louchard, Kenyon and 
Schott [20] (with different proofs). 

Note that in the proof of Theorem 18.21 the terms with S n i cancel, as 
discussed for Theorem 11.51 above. In both theorems the limit is thus given 
by Z2, which explains why we obtain exactly the same covariances in the 
two theorems, except for a normalization factor. (Unlike Corollary 14.21 and 
Theorem 18. 1|. where the variances of the limits are t(l-t)(l-3t + 3t 2 ) and 
2t(l-t)(l-2t + 2t 2 ).) 

Acknowledgement. I thank Goran Hognas for telling me about this prob- 
lem and providing me with the reference pQ . I also thank Anders Martin-L6f 
and Guy Louchard for interesting discussions. 
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